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AUTHORS 



TITLE 



linear BCT 06-JUN-1995 



ECNRFA 7320 bp DNA 

E.coli nrfA gene. 
X72298 

X72298.1 GI:4O4302 

cytochrome c-552; nitrite reductase; nrfA gene. 
Escherichia coii. 
Escherichia coli 

Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; 

Escherichia . 

1 {bases 1 to 7 320) 

Darwin, A., Hussain,H., Gr if f iths, L. , Grove/J., Sambongi,Y., 
Busby, S.. and Cole, J. 

Regulation and sequence of the structural gene for cytochrome c552 
from Escherichia coli: not a hexahaem but a 50 kDa tetrahaem 
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iitiii I Hill I ii II I III Mil I Mill II nil 

TT ACTTCGACGGC AAAAACAAAGCGGTT AAATTCCCGTGGGATG ACGGCATG AAAGTCG A 1009 

TC AGATC ATGGCCTACT ACG ACG AGGACGG ACACTCCGACTGG ACGC ACGAGCTC ACGGG 893 

III I II Mini I I Ml IIMIIII II M II 

AAATATGGAGC AGTATTACGACAAAATTGCCTTCTCTGACTGG ACT AACTCCCTGTCGAA 1069 



CGCCAAGGTGCTGAAG 

I mint 

AACGCCAATGCTGAAA 



ICGC AGCACCC tG AGTTCG AGATGT AC AACC AGGGCATCC ACGC 953 

niiiiiiii[M iiiii nil 

ICGC AGCACCC 3GAAT ATGAAACCTGG AC AGCGGGC ATTC ACGG 1129 



GAAGAGCGGCGTGGCCTGCGCGGACTGCCACATGCCGTTCATGCGCGAGGGGGCGATGAA 1013 

II I I I II I II I I I I I 1! I t I I n I I Ml r I I II 

T AAAAACAACGTGACCTGT ATCG ACTGCCATATGCC AAAAGTGCAGAACGCCGAAGGC AA 1189 

GGT- - -CAGCGACCACCAGGTGCGCAGCCCGCTGCTGAACATCAACCGCGCGTGCCAGAC 1070 

I II llllll I I II Mil IIIII II ill M 

ACTCTACACCG ACC ATAAAATTGGTAATCCGTTTG AT AACTTCGCCCAGACTTGTGCG AA 1249 

GTGCCACAAGTGGAGCGAGGCGGAGCTGCTCC AGCGCGCGG AGACC ATCCAGACGCGC AC 1130 

It I II I llllll III I III Ml I 

CTGCCATACCCAGGACAAAGCTGCCTTGCAAAAAGTGGTCGCGGAACGTAAGCAGTCGAT 1309 



Ml I Mil III IIIII M I 11 III 



1369 



I I It I I II tl I I II I t I I M I 



CGCTCAATTCT ACCTGGACTTCGTGGAGGCGGAG AACTCCATGGGCTTCCACGCGG ACCA 1310 

nil) I III Mi II II I M l lllll I 

TGCCCAGTGGCGCTGGGATCTGGCGATCGCTTCCC ACGGC ATTCATATGCACGCACCGG A 14 89 

GGAGGCGGTGCGCATCCTGAGCAACTCC AT 134 0 

III I I I I I I I M til 
AGAAGGTTT ACGG ATGCTCGGTACGGCGAT 1519 
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ECNRFA 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



7320 bp DNA 



linear BCT 06-JUN-1995 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REMARK 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 
FEATURES 

source 



gene 
CDS 



nrfA gene. 



gamma subdivision; Enterobacteriaceae; 



Grove, J., Sambongi,y. 



ECNRFA 

E.coli nrfA gene, 
X72298 

X72298.1 GI:404302 

cytochrome c-552; nitrite reductase; 
Escherichia coli. 
Escherichia coli 
Bacteria; Proteobacteria; 
Escherichia . 

1 (bases 1 to 7320) 
Darwin, A., Hussain,H., Griffiths, L, 
Busby, S. and Cole, J. 
Regulation and sequence of the structural gene for cytochrome c552 
nl^ri^rrlductase"""^"' """^ ^ hexahaem but a 50 kDa tet^ahaem 

Mol. Microbiol. 9 (6), 1255-1265 (1993) 

95020657 

7934939 

2 (bases 1 to 7320) 
Hussain,H. A. 
Direct Submission 

Submitted (28-MAY-1993) H.A. Hassain, University of Birmingham, 
School of Biochemistry, Edgbaston, Birmingham, B15 2TT, UK 
revised by [3] 

3 (bases 1 to 7320) 
Hussain,H.A. 
Direct Submission 

submitted (29-SEP-1993) H.A. Hussain, University of Birmingham, 
School of Biochemistry, Edgbaston, Birmingham, B15 2TT, UK 
On Sep 30, 1993 this sequence version replaced gi -312132 

Location/Qualifiers 

1. .7320 

/organism== "Escherichia coli" 

/db_xref = " taxon : 5 6 2 " 

/map-"4367-4374 KOHARA MAP" 

303. ,1739 

/gene="nrfA" 

303, .1739 

/gene="nrfA" 

/codon_s tar t= 1 

/transl_table=ll 

/protein_id="CAA51048 . 1" 

/db_xref=''GI : 853826" 
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177 iieLysGlyPheGluLysMetAsnGlnMetProPheMetGluAlaArg--- 192 

693 GCGCGTCTGATCCAGAAAGACGGCGAAGATGGCTACTTCCACGGTAAATGGGCGCGCGGC 752 

193 - LysLeuValGluHisProValSerCysIleAspCysHisAspPxoThrThr"- 209 

::: IM III Mil = 

7 5 3 GGTCCGGAAATCGTCAACAAC - - - TT AGGTTGTGCCGATTGCCAT AACACCGCCTCTCC A 809 

210 - - ---MetGlnLeuArgValThrArgProGlyPhelleGluGlylle 223 

:::||1 ::::::|||lll = = = 

810 GAGTTCGCCAAAGGCAAACCGGAGTTAACCCTTTCCCGTCCGTATGCGGCTCGCGCGATG 869 

224 AlaAlaLeuLysAlaSerGlnGlyValProAsnPheArgValAsnGlnAspAlaThrArg 24 3 

I I I - • • III III ' ill III 

870 GAAGCCATT GGTAAACCT TTTGAGAAAGCCGGACGT 905 

244 GlnGluMetArgThrTyrlaiCysGlyGlnCysHisValGiuTyrTyrPhiysGlyLys 263 

::: :::::: j 111 I I I 1 I I I 1 I II I I I I I I I I I I I I I MHII 

906 TTCGACCAGCAATCGATGiTTTGCGGTCAGTGCCATGTGGAGTATTACTTCPACGGCAAA 965 

264 GluLysArgLeuThrTyrProTrpAlaLysGlylleAsnlleAspGlnlleMetAlaTyr 283 

III • • ' : : : i I I I I I ill::: :::::: : : : III 

966 AACAAAGCGGTTAAATTCCCGTGGGATGACGGCATGAAAGTCGAAAATATGGAGCAGTAT 1025 

303 

3TGAAA 1085 



284 TyrAspGluAspGlyHisSerAspTrpThrHisGluLeuThrGlyAlaLysValLeuLys 
I I nil::: 111111111111::: |1|::: " = UJ, 111 

1026 tacgacaaaattgccttctctgactggactaactccctgtcgaaaacgccaatg:tgaaa 
304 AlaGlnHisProGluj^heGluMetTyrAsnGlnGlylieHisAlaLysSerGlyValAla 323 



1086 GCGCAGCACCCGGAAtATGAAACCT^ l^^^ 
324 cysAlaAspCysHisMetProPheMetArgGluGlyAlaMetLysVal---SerAspHis 342 

III I II I II II t I I II I I :::::: III::: : : : II I I I I 

TGTATCGACTGCCATATGCCAAAAGTGCAGAACGCCGAAGGCAAACTCTACACCGACCAT 



III ::: IIIIIIIM Ml:- IN 



1146 



1205 



34 3 GlnValArgSerProLeuLeuAsnlleAsnArgAlaCysGlnThrCysHisLysTrpSer 362 

: : : I I I III : : : III 1 I I M I 

1206 AAAATTGGT AATCCGTTTG AT AACTTCGCCC AG ACTTGTGCG AACTGCCAT ACCC AGG AC 

363 GluAlaGluLeu LeuGlnArgAlaGluThrlleGinThr ArgThrPhe 378 

.-•III III : : : I I I :::::: I I I 

1266 AAAGCTGCCTTGC AAAAAGTGGTCGCGG AACGT AAGCAGTCGATT AACG ACCTGAAAATC 

37 9 GluThrArqAsnlleAlaMetAspAlaLeuValAspLeuIleHlsAspIleGluAlaAla 398 
III llllll :::IIMII 
1326 AAGGTTGAA GATCAACTGGTTCACGCTCACTTCGAAGCGAAAGCAGCG 

399 GlnLysAlaGlyGlnSerGluGluAlaLeuAlaLysAlaArgAspLeuGlnLysArgAla 418 

llllll : 1 I I : : : : : : I I I ■ 111 

1374 CTGG ATGCAGGCGCG ACGG AAGCTG AAATG AAGCCAATTCAGGACG AT ATCCGTCATGCC 

419 GlnPheTyrLeuAspPheValGluAlaGluAsnSerMetGlyPheHisAlaAspGlnGlu 
lit... 1)1 III : : : : : : llllll : : : I I I 

1434 CAGTGGCGCTGGGATCTGGCGATCGCTTCCCACGGCATTCATATGCACGCACCGGAAGAA 1493 

439 AlaValArglleLeuSerAsnSerlleAsnPheSerArgLeuGlyGlnAsnAlaLeuArg 458 
... I I ] ... I I 1 III::: 
14 94 GGTTTACGGATGCTCGGT ACGGCGATGGAT 1523 

459 ProSerGlyGlyAlaSerThrSerProThrThrArgProGlnGlyAlaProAla 476 

1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 i 1 1 1 1 1 

1524 AA-AGCGGCGGATGC - - ACGCACCAAACTGGCGCGCCTGCT 1561 



1265 



1325 



1373 



1433 
438 



